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ABSTRACT 

Space-based microlens parallax measurements are a powerful tool for under¬ 
standing planet populations, especially their distribution throughout the Calaxy. 
However, if space-based observations of the microlensing events must be specif¬ 
ically targeted, it is crucial that microlensing events enter the parallax sample 
without reference to the known presence or absence of planets. Hence, it is vital 
to dehne objective criteria for selecting events where possible and to carefully 
consider and minimize the selection biases where not possible so that the hnal 
sample represents a controlled experiment. 

We present objective criteria for initiating observations and determining their 
cadence for a subset of events, and we dehne procedures for isolating subjective 
decision making from information about detected planets for the remainder of 
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events. We also define procedures to resolve conflicts between subjective and ob¬ 
jective selections. These procedures maximize planet sensitivity of the sample as 
a whole by allowing for planet detections even if they occur before satellite obser¬ 
vations for objectively-selected events and by helping to trigger fruitful follow-up 
observations for subjectively-chosen events. This paper represents our public 
commitment to these procedures, which is a necessary component of enforcing 
objectivity on the experimental protocol. 

Subject headings: gravitational lensing: micro 


Introduction 


1.1. Measuring the Distances to Microlensing Planets 

While more than 6000 planets (and strong planetary candidates) have been found within 
about 1 kpc of the Sun (the great majority discovered via the transit and radial velocity 
techniques), there are only a handful of conhrmed planets with known distances that are 


greater than 4 kpc and only one conhrmed planet in the Galactic bulge (Batista et ah 


2014). All of these distant planets were found using gravitational microlensing, and in 


most cases the distances were determined using the “microlens parallax” technique (Gould 


1992). Microlensing would therefore appear to be the most natural method to measure the 


Galactic distribution of planets, i.e., to determine planet frequency as a function of Galactic 
environment. Such a measurement would provide important constraints on planet formation 


theories. For example, Thompson (2013) has suggested that gas-giant formation may have 


been inhibited in the Galactic bulge due to the high intensity of ambient radiation during 
the main epoch of star formation. 

However, while roughly half of the ~ 30 published microlensing planets have measured 
distances, this sample is heavily biased toward nearby systems. The reasons for this are well 
understood and are closely related to the general biases in astronomy toward nearby objects. 


First, nearby lenses have larger lens-source trigonometric parallaxes, TTrei = AU(D 
which gives rise to larger microlens parallaxes 
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where is the lens-source relative proper motion (in either the heliocentric or geocentric 
frame), 9-^ is the angular Einstein radius, and M is the lens mass. As explained in some 


detail by Gould & Horne (2013), the magnitude of tte quantihes the amplitude of the parallax 
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distortion on the microlens light curve, so that all other things being equal, larger tte implies 
easier detection. The most common method for measuring microlens parallax has been 
to observe the effect of Earth’s acceleration on the light curve (so-called orbital parallax). 
However, for typical Einstein timescales ts ~ 20 day, this effect is quite modest. This means 
that in addition to nearby lenses and low mass lenses, one is biased toward abnormally long 
duration events. It is difficult (though probably not impossible) to quantify these biases, but 
the main problem is that due to these biases, there are simply no microlens planets in the 
Galactic bulge with measured microlens parallaxes. Indeed, the one conhrmed Bulge planet 
had its distance measured by other means. 


This brings us to the other method of measuring lens distances: direct detection of the 
lens. The main difficulty is that the lens is superposed on a (usually) substantially brighter 
source, and remains so for typically a decade or more after the event. If the lens is sufficiently 
bright, then it is possible to directly detect it by measuring the combined source and lens 
light using high-resolution imaging (adaptive optics or Hubble Space Telescope (HST)) and 
subtracting out the source contribution, which is known from the light curve model. This, in 
fact, is how the distance to the only planet known to be in the Galactic bulge was measured 
(MOA-2011-BLG-293Lb; Batista et al.||2014 ). At the present time, this method is primarily 
limited to lenses that are at least 15% as bright as the source: otherwise the excess light due 
to the lens cannot be reliably detected. Hence, it is biased toward luminous (i.e., massive) 
and nearby lenses. 


The alternative is to wait until the source and lens separate due to their relative proper 
motions (typically a few mas yr“^) and can be individually resolved. Again, this method is 
more easily applied to brighter lenses and with current facilities, one must wait ~ 10 yr for 
the source and lens to separate sufficiently. When the next generation of 30 m telescopes are 
available, it will be applicable to much fainter lenses because these will separate sufficiently 
from the sources to be resolved within a few years due to their relative proper motions 


(Alcock et ah 2001; Gould 2014; Bennett et al. 2015 Batista et ah 2015 Henderson 2015) 


Therefore, the only path at present to routinely measure the distances to lenses (espe¬ 
cially faint lenses), and hence to measure the Galactic distribution of planets, is via space- 
based microlens parallaxes. In this approach, one observes a microlensing event simultane¬ 
ously from Earth and from a satellite in solar orbit, and derives tte from the difference in 


the two light curves (Refsdal 1966). There are some challenges to this method (over and 
above the problem of gaining routine access to such a satellite). First, the results are subject 


to a four-fold degeneracy in tte, including a two-fold degeneracy in tte. However, Galchi 


Novati et al. (2015) showed that it is possible in practice to break this degeneracy in the 
great majority of cases. Second, tte does not by itself yield distances and masses. Rather 
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this requires knowledge of 6 *e, 


TTrel = 6'E7rE, M = 


9 , 


KTTe 


( 2 ) 


and of the source parallax vr^ (tte = vTi-ei + vr^), although the latter is usually known quite 
adequately. Fortunately, 6 *e is usually measured for planetary events because the normalized 
source size p = 6^/6-e. can usually be measured from the source crossing of the planetary 
caustic, while the angular source size 9^ is almost always known from its color and magni¬ 
tude. Moreover, even for non-planetary (and non-binary) events, which generally lack such 
crossings, the lens distance (and so mass) can usually be estimated quite well from the mea¬ 
sured tte and kinematic arguments (Calchi Novati et al.||2015). Finally, for the case that the 


source proper motion can be measured, this estimate becomes even more accurate (Yee et 


al. 2015). 


Hence, as shown by Calchi Novati et al. (2015), one can obtain an accurate estimate 


of the cumulative distribution of lens distances from a given sample, and can in principle 
compare this to the cumulative distance distribution of detected planets. 


1.2. Spitzer and the Galactic Distribution of Planets 

To determine the Galactic distribution of planets, however, the detected planets must be 


compared to the underlying distribution of planet sensitivities, not simply of events. Calchi 


Novati et al. 

(2015 

) did not attempt to do this because there was only one planet in their 

sample ( 

Udalski et al.j2015 

), making a meaningful comparison impossible. The small number 


of planet detections was rooted in the nature of the observing campaign, which was a 100-hr 
“pilot project” to determine the feasibility of making such microlens parallax measurements 
using Spitzer. Thus, the Spitzer observations were limited to the subset of events judged 
most likely to yield tte, and no special effort was made to hnd planets within these events 
via, for example, intensive follow-up observations. 


Calchi Novati et al. (2015) argued, nevertheless, that it would be possible to estimate 


the cumulative distribution of sensitivities, simply by measuring the sensitivity of each event 


in the standard fashion (Rhie et al. 2000 Gaudi & Sackett 2000 Gaudi et al. 2002) and 


multiplying these sensitivities by the distance distributions in their Figure 3, even though 
the selection function of the events was unknown (and probably unknowable). This argument 
rested critically on the fact that the events were monitored from the ground and chosen for 
Spitzer observations without regard to the presence of absence of planets. This is a very 


similar argument to the one made by Gould et al.| ( 

2010 

) in the hrst study to derive planet 

frequencies from microlensing planet detections. 

Galchi Novati et al. 

(2015 

) further argued 



































that their sample could be concatenated with future space-based samples, regardless of 
whether these were carried out using Spitzer or other satellites such as Kepler, and regardless 
of whether the selection function was the same or different, known or unknown. The only 
proviso was that, as with the Gould et al. (2010) and Calchi Novati et al. (2015) samples, 
the events were monitored without regard to the presence or absence of planets. 


1.3. This Paper 








Table 1 . Glossary of terms 
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The goal of the Spitzer microlensing parallax program (and indeed any space-based 
parallax program) is to create a sample of events with well-measured parallaxes. If these 
events are observed by the satellite without regard to whether or not they have planets, 
the hnal sample can be used to determine the Galactic distribution of planets, e.g., by 
comparing the frequency of planets in the Galacitic bulge with the frequency of planets in 
the disk. Hence, achieving this scientihc goal has three primary considerations. First and 
foremost, the decision to select an event for Spitzer observations must be independent of any 
knowledge of the presence or absence of a planelQ Second, these observations must lead to 
a measurable parallax. Finally, maximizing the constraints on the Galactic distribution of 
planets requires maximizing not only the number of planets detected but also the range of 
planets that could he detected (i.e. the planet sensitivity), since the detection efficiency is a 
crucial component to any measurement of the planet occurrence rate. 


The primary goal of the present paper is to determine a strategy to monitor events with 
Spitzer to ensure the hnal sample of events with parallaxes is monitored without reference 
to the presence or absence of planets, because this is the property of the sample that is 
most difficult to control. At the same time, this strategy is driven by the additional goals 
of maximizing both the planet sensitivity of the monitored events and the likelihood of 
measuring parallaxes. By dehning this strategy in advance of the observations, we can 
create a sample of events with measured parallaxes with maximal leverage for measuring the 
occurrence rate of planets as a function of Galactic distance. 


We begin, in Section with a general discussion of how events may be selected, either 
objectively or subjectively, and how that selection affects the resulting planet sensitivities of 
those events. Since much of that discussion is guided by planet sensitivity and the practical 
considerations of the Spitzer campaign, the reader may also wish to refer to Sections 
and 5.2 Sections [3]-[^ cover the various ingredients necessary to dehne criteria for selecting 
events, namely the planet sensitivity and the probability of measuring parallax. Then in 
Section we formally dehne objective criteria for the Spitzer campaign to select events 
and also determine their observing cadences. Section then discusses specihc guidelines 
for subjectively choosing events for this campaign, and Section specihes how the available 
observations will be distributed amongst the targets. Finally, we give a brief summary in 
Section We have provided a glossary of terms in Table to clarify some qualitative 
statements we may use and also colloquialisms that have arisen in microlensing. 


^While it is more natural to think about the presence of planet creating bias in the sample, the absence of 
planets is also important. For example, if an event does not show evidence of a planet, it could be excluded 
from selection for parallax measurements because it is “uninteresting.” Hence, the absence of planets has 
similar potential to create bias in the final sample but in the opposite direction as the presence of planets. 
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2. Objective vs. Subjective Selection Criteria 


There are many choices that must be made with respect to Spitzer observations of any 
individual event. One must decide when to begin making such observations, when to commit 
to the target]^ with what initial cadence, whether and when to change this cadence, and 
whether and when to halt the observations. This entire chain must be carefully established 
in order to ensure the fundamental requirement that the observational sequence be indifferent 
to the presence or absence of planets. Table gives a brief overview of the relevant decision 
points, and Section 5^ discusses how the specihcs of Spitzer operations set the quantitative 
dehnitions. 


The starting point is the choice to begin monitoring an individual event, i.e., “triggering” 
observations. This choice can be made either because the event meets some objective criteria 
(in which case the “choice” is automatic) or according to some subjective criteria of the 
team organizing the observations. Table summarizes the various channels through which 
observations may be triggered. However, all other decisions about the monitoring are heavily 
influenced by the hrst dichotomy (objective vs. subjective), so we divide the discussion 
according to it. As we will describe, optimal event selection requires a combination of 
objective and subjective selection. Because this is so, one must also decide what to do in 
advance if the objective and subjective selection procedures collide. That is, what should be 
done if an event is selected subjectively, but later meets the objective criteria for selection. 

We discuss the architecture of the selection procedure before discussing the criteria 
themselves because the architecture is both non-trivial and logically independent of the 
criteria. Within the framework of this discussion, one must keep in mind that the overall 
goal is to maximize the sensitivity of the experiment to planets and that planet sensitivity 
rests primarily on ground-based observations (see Section]^. At the same time, after an 
event is selected, its entry into the hnal sample to measure the Galactic distribution of planets 
requires that its parallax is measured, which depends primarily on Spitzer observations. 

In the following sections, we occasionally give examples to illustrate the points under 
discussion. For these examples, it may be helpful to keep in mind that some of observables 
that affect planet sensitivity and parallax measurements include the time of the event peak to, 
the magnihcation of the event (larger is better), and the magnitude of the event as seen from 
the ground or from Spitzer (brighter is better). These observables and their relationship to 
planet sensitivity and parallax are discussed in detail in Sections and The hnal criteria 


^The distinction between deciding to observe a target and committing to a target is discussed in Sec¬ 


tion 


2.2 





Table 2. Definitions of Relevant Times 
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Table 3 . Ways to Select an Event 
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are given in Section 

2.1. Objectively Chosen Events 

2.1.1. Objective Selection 


The great advantage of choosing events by objective criteria is that any planet that is 
discovered during an event that is so chosen can be included in the sample, and similarly, 
the planet sensitivity of the event over its entire duration can be included in the analysis as 
well. For example, suppose an event is announced by a survey group on May 1 but Spitzer 
observations cannot begin until June 8. The event undergoes a planetary deviation on May 
15, peaks on May 28, and on June 3 is scheduled for Spitzer observations beginning June 
8 because it is found to meet previously chosen objective criteria. Then the planet can be 
included in the sample, even though it was discovered before the Spitzer observations began, 
and even before it was known that it would eventually satisfy the objective conditions that 
triggered observations. 


By contrast, in the absence of such criteria, the event could have been selected for Spitzer 
observations subjectively. In that case, neither the previously discovered planet, nor the 
planet sensitivity from the the entire pre-decision period, could be included in the analysis. 
Otherwise, the presence or absence of the planet could influence the event’s “selection” (i.e., 
inclusion in the hnal sample with measured parallaxes; Section 5.1 discusses what is meant 
by a “measured parallax”). 


2.1.2. Objective Cadences 

A large fraction of objectively chosen events will be similar to the hypothetical one 
described above in that most of their planet sensitivity will be past at the time that the 
Spitzer observations begin. Therefore, it is absolutely essential that the cadence be chosen 
objectively as well. In order to enter the sample, the event must have a measured parallax. 
If the cadence is not chosen objectively, events with planets could receive extra observations 
to help ensure they have measured parallaxes. We will discuss specihc algorithms to make 
this choice in Section |6l 

Finally, we will just mention that to avoid wasting observations, there must also be a 
mechanism for halting this objectively-determined Spitzer observation schedule when these 
observations are no longer useful. However, the only permitted reasons for doing so are that 
the microlens parallax has been measured or that the event (as seen from Spitzer) has already 
returned to baseline (i.e., is now essentially unmagnihed), so that no further improvement is 
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possible. 


2.2. Subjectively Chosen Events 
2.2.1. The Need for Subjective Selection 


At first glance, the advantage conferred by the objective approach appears to be so great 
that one might wonder why one would consider the subjective approach at all. The main 
problem is it is impossible to dehne objective criteria loosely enough to capture all events 
of interest without at the same time introducing a large number of events either with poor 
planet sensitivity or a low probability of yielding a parallax measurement (“bad” events). 
Hence, the objective criteria must be strictly dehned so that all of the selected events are 
both highly sensitive to planets and have a high likelihood of yielding a parallax (“good” 
events). Otherwise, a large amount of observing time will be wasted on events of little value. 

A second issue is whether or not an event chosen objectively will yield a parallax mea¬ 
surement. The central difficulty is that the event’s objectively-chosen Spitzer observational 
sequence must yield a parallax measurement. It can be quite difficult to choose such obser¬ 
vational sequences based solely on objective criteria, or to determine which events might be 
worth the observational effort to obtain parallaxes, or even to determine which might yield 
parallax measurements with any sort of effort. These difficulties can all be more effectively 
addressed by subjectively choosing the event, in which case one can also choose a cadence 
(or cadence algorithm) that is individually tailored to that event. For example, suppose that 
it is known that an event will meet objective criteria in 2 weeks, but this will allow for only 
1 week of Spitzer observations. If we wait to start observations until this date, we risk the 
possibility that 1 week of data will be insufficient to measure a parallax, in which case the 
entire event and its sensitivity is lost. In contrast, we could select the event subjectively 
now to get 3 weeks of Spitzer observations and vastly improve the probability that those 
observations will yield a parallax. 


Hence, there are two reasons that events might be chosen subjectively. First, because the 
objective criteria cannot capture all events of interest. Secondly, because an earlier subjective 
trigger may make the difference between measuring a parallax or not. As we discuss in the 
next section, not much planet sensitivity is lost by subjectively choosing events. We will 
also discuss in Sections 2^ and |2.4| the resolution of conflicts for events that may be selected 
both subjectively and objectively. 
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Fig. 1.— Example fits to simulated event data. The dashed line shows the underlying point 
lens {uo = 0.2, = 20 days) from which the data were generated with Gaussian errors. The 

right-hand panels show data sampled 10 times/night, every night, and the left-hand panels 
show the same data, but thinned to 1 observation every 2 nights. Starting from the top, the 
rows show the approximate l-cx range of hts (solid lines) to the data at f = 18,15,12, 5 days 
before the peak and at f = 3 days after the peak. These hgures show that the hts to the 
high-cadence data (10 obs/night) are much better at capturing the true, underlying behavior 
of the events, whereas hts to the sparse data (0.5 obs/night) often give misleading hts that 
do not encompass the true event. 


0.5 obs/night 10 obs/night 






















15 


2.2.2. Subjective Selection 


If, for whatever reason, an event fails to meet the objective selection criteria, the team 
may decide to observe it anyway. The reasons one might want to do this are discnssed in 
some detail below, bnt hrst we focns on the conseqnences of this decision, which leads to 
three types of snbjective selection as dehned in Table 

Actnally, the key decision is not whether to begin Spitzer observations but when to 
(publicly) commit to a schedule of S'pzteer observations. Once such a decision is made, it must 
be accompanied by a public commitment of an observational sequence (or to an objective 
algorithm for determining that sequence). Otherwise, one could choose events without any 
knowledge of whether they would later show planets, discover that they do indeed host 
planets, and then be biased to observe them more frequently in order to preferentially increase 
the probability that their parallax will be measured, thus placing them in the sample for 
measuring the Galactic distribution of planets. 

In the case of subjective decisions, all planets that show up after the public commitment 
date (tcom), as well as all sensitivity to planets after this date, would be kept in the analysis, 
while all planets and planet sensitivity from before this date must be excluded. 


We illustrate the need to separate the decision to observe from the commitment to 
observe with two examples. 

First, suppose that at a given Spitzer decision time (typically Gei =Monday, see Sec¬ 


tion 5.2 for details of the logistical constraints on Spitzer observations) an event that has 


not yet peaked has an ambiguous future, with some chance that it will rise in magnihcation 
sufficiently to have good sensitivity to planets and to enable a viable parallax measurement 
(e.g., panel 2a of Figure [^. However, as of Monday, this cannot be established with any 
conhdence, although ground-based data are likely to resolve this ambiguity with a few days. 
Based on this assessment, the team decides to observe this event once per day during the 
week beginning at the next upload three days hence, i.e., fj,next =Thursday. Even though the 
future of the event is uncertain, preemptive observations (rather than waiting for the next up¬ 
load cycle, one week hence) could make the difference between a good parallax measurement 
and a meaningless upper limit. This is especially true since the bpzteer observations can only 
be updated once per week. Then on Sunday, ground-based data show that this event has 
risen sufficiently that its future behavior can be predicted well enough to determine that it is 
interesting target (e.g. panel 2c of Figure as compared to panel 2a). The team then an- 


an 


nounces that it is committed to monitoring this event and also announces its chosen cadence 
(or objective procedure for determining such cadence). All planets and planet sensitivity 
from team =Sunday forward can then be included in the analysis. Similarly, only Spitzer 
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data beginning on Snnday can be nsed in the initial parallax measnrement measnrement 
that determines whether the event enters the sample. Even thongh the Thnrsday-Satnrday 
observations cannot be nsed, this preemptive decision to observe has resnlted in extra ob¬ 
servations (Snn-Wed) that can be used to improve the parallax measurement as compared 
to waiting until the following Thursday to begin Spitzer observations. With regard to the 
specihc role of Spitzer data in planet sensitivity and discovery: if the team has seen the hrst 
few days of Spitzer data prior to the Sunday commitment, then these cannot be included, 
but if they have not yet seen them, these can be (since in this case they would not have 
influenced the decision). 

For a second example, consider the same case as above but with the event peaking 
at low magnihcation, and hence having both low sensitivity to planets and low likelihood 
of a measurable parallax (e.g., the data instead follow the solid black line in panel 2a of 
Figure [^. The team then decides not to continue monitoring this event. Because the team 
never committed to the event, they have no obligation to continue monitoring it, and so it 
is entirely dropped from the sample. By the same token, any planets discovered from this 
event cannot be included in the analysis. In contrast, if the decision and commitment were 
the same process, in order to avoid bias, the team would be required to continue monitoring 
an event that it recognized as worthless. 

In the hrst example given above, the public commitment to observe the event was made 
after the Spitzer observation sequence was uploaded to the spacecraft and indeed after those 
observations began (fcom > tynext > tsei)- This is the situation described as “Subjective, 
secret” in Table However, the same principles apply to, for example, an event that is 
newly recognized a few days before the upload decision and is already recognized to be 
promising (e.g. panels 2c or 2d of Figure [^. The team could publicly commit to observing 
this event immediately with specihed cadence (i.e. “Subjective, immediate” in Table [^, and 
then all planets and planet sensitivity from that date forward would be included (provided 
the parallax was measured well enough to put the event in the sample). 

Finally, in the hrst example, the team might publicly commit immediately to observe 
the ambiguous event, but with explicitly stated criteria for halting such observations (Table 
1^ “Subjective, conditional”). For example, it might specify that observations would be 
discontinued if the event failed to reach some specihed magnihcation before the time of the 
next upload. In this case, whether or not the observations were continued, the event might 
have sufficient Spitzer data to measure the parallax and enter the sample. If so, all planets 
and planet sensitivity from after tsei = tcom could enter the sample. 

At hrst sight, “Subjective, conditional” seems clearly superior to “Subjective, secret” 
because it enables inclusion of more planets and more planet sensitivity. However, this is 
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not always the case. In fact, it often happens that the uncertainty in predicting rising events 
includes not only to their time of peak and peak magnihcation, but extends even to when 
this knowledge will be reliably available. For example, consider the evolution of the fits to 
the data in column 1 of Figure [T| which illustrates how the fits may change dramatically 
due to small fluctuations in the data and may not capture the true, underlying behavior of 
the event. If the team cannot reliably predict the future course of the event, it may not 
be possible to correctly pre-define criteria for halting observations. This creates the risk of 
being committed to observing many bad events or being forced to halt observations of an 
event that turns out to be good, but with different parameters than initially supposed. 

Thus, as summarized in Table subjective decisions can take a considerable variety of 
forms. The only constraint is that they must be constructed to avoid the possibility that the 
presence or absence of planets detected after tcom will influence the cadence of observations. 


2.2.3. Subjective Cadences 

In contrast to objectively chosen events, the cadences for subjectively chosen events can 
be chosen by the team. However, they must be fully specified at the time of the commitment 
to observations (tcom)- As with objectively chosen events, after fcom the only permitted 
reasons for halting the scheduled Spitzer observations are that the parallax has already been 
measured or the event has returned to baseline (as seen from Spitzer). 


2.3. Collisions: Subjectively Chosen Events that Meet Objective Criteria 

An event that has been subjectively chosen may, at a later date, meet the objective 
criteria. In this case, the objective selection of the event must take precedence. Otherwise, 
there is no point in having objective criteria since they could always be subjectively overrid¬ 
den. The objective criteria will specify an objectively-determined cadence of observations. 
The more frequent of the subjective or objective cadences will take precedence for future 
observations. However, only the observations taken after the objective selection and at the 
objective cadence can be used to determine whether the parallax measurement is adequate 
to enter the sample. If it does enter, then all planets that are detected and all planet sen¬ 
sitivity (including from before the commitment to the event, fcom) enter the analysis. The 
next section will clarify the reason for maintaining a higher, but subjective, cadence even if 
only a fraction of the data can be used for the objectively-measured parallax. 

Note that for these cases all planets discovered in Spitzer data (as well as all planet 
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sensitivity from these data) should be included in the analysis, regardless of whether these 
data were taken before or after or in response to the commitment to the subjective selection 
of the event. The only exception would be if the planet were detected purely in the Spitzer 
data and subsequent Spitzer observations were increased because of it (see Section 2.5). 


2.4. Reversions from Objective to Subjective 

If an event has 1) been chosen subjectively, 2) subsequently satished the objective criteria 
and thus triggered a conversion to being objectively-chosen, 3) fails to yield a parallax based 
on the objective portion of the Spitzer light curve, and 4) does yield a parallax based on all, 
post-commitment Spitzer data, then it automatically reverts to subjective status. In this 
case, only planets and planet sensitivity from the post-commitment part of the light curve 
can be included in the analysis. 

The basic reason is that none of the decisions made after this commitment depended in 
any way on the presence or absence of a planet (other than possible planets before this date, 
which must be excluded from the calculation of planet occurrence). 

2.5. Planet-Characterization Observations 


As discussed above, whether the decision to observe an event with Spitzer is objective 
or subjective, the cadence must be chosen without reference to the presence or absence of 
a planet. For the objectively-chosen events, this cadence must be determined objectively 
by pre-determined criteria. For subjectively chosen events, they must be announced at the 
times of the decision, i.e., prior to the discovery of any planets. 


However, once a planet is discovered, it can be important to increase the pace of ob¬ 
servations from the ground and/or Spitzer in order to improve the characterization of the 
planet. In this case, one can increase the observational cadence, but only the observational 
data that would have been taken under the pre-determined schedule can be used to assess 


the detectability of planets and the measurability of the parallax. See, for example, Yee et 


al. (2012). Application of this rule is straightforward for Spitzer observations, which is the 


main focus in this section, because, as we have specihed, the observational cadence is in 
fact pre-determined. The situation is more complicated for ground-based observations of the 
same event, to which we now turn. 


2.6. Ground-based Follow-up Observations 


The majority of microlensing planets published to date were discovered by a combination 
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of microlensing surveys that find large numbers of events, mostly with a low-to-moderate ca¬ 
dence of observations, and follow-up surveys that target individual events for more intensive 
monitoring in order to enhance the discovery and characterization of planets. In fact, survey 
groups sometimes go into “follow-up mode” by increasing the cadence in the survey field that 


contains a particularly interesting event ( 

Yee et ah 

2012 

) or even re-centering an existing 

field to incorporate a particular event ( 

Yee et ah 

2014 

). For present purposes, observations 


by specialized follow-up groups and survey teams in “follow-up mode” are equally considered 
as “follow up”. The only exception would be follow-up observations that are determined by 
purely objective criteria. 


Follow-up observations must be evaluated with respect to two questions: hrst, how do 
they impact planet sensitivity and planet detection, and second how do they impact the 
measurement of microlens parallax (and so entry into the sample). 

2.6.1. Follow-Up and Planet Sensitivity/Detection 


As with survey observations, follow-up observations contribute to planet sensitivity and 
detection for the entire duration of objectively chosen events and for all post-commitment 
observations of subjectively chosen events (which may be a time-span much longer than the 
window for Spitzer observations). Indeed, while the fundamental point of announcing sub¬ 
jective choices of events for Fpzfzer observations (i.e., the commitment to observe an event) 
is to establish a record of what planet sensitivity can be included, a major secondary goal is 
to encourage observations of these events, particularly those that are not well covered by the 
surveys, (see Section 4.1 for a general outline of standard survey strategies). Such subjective 
announcements automatically have the effect of encouraging follow-up observations because 
prior to such announcement, the planet detections can enter only if the event has already 
satisfied, or ultimately proves to satisfy, objective criteria. 


The only question is whether changes in the adopted cadence of follow-up observations 
due to the perceived presence of planets influences the detectability of the planets. This 
can happen in principle if the planet generates an observed perturbation (in either survey or 
follow-up data) that is strong enough to trigger interest but not, by itself, sufficient to conhrm 
the presence of the planet. In this case, follow-up observations aimed at characterizing the 
planet can make the difference between it being undetectable and being detectable. 


This issue is not particular to parallax experiments: it pertains to any experiment that 
aims to make a statistical statement about planets using microlensing follow-up data. For 
example, Gould et ah ( 2010[ ) noted that two of their six detected planets occurred in an 
event that showed an early (and in itself, not comprehensible) perturbation that ultimately 
proved to be due to a planet. This early perturbation did trigger additional observations, but 
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these subsided over the next few days. Observations only intensihed again when the event 
approached high magnihcation, which was the standard trigger for high-cadence follow-up 
observations. 

In brief, these issues arise in a minority of planetary events, and usually can be re¬ 
solved based on records of the decision making. While it is possible that there may be 
unresolved cases in the future, the importance of characterizing planets is too great to allow 
this possibility to interfere with aggressive follow-up response to the tentative detection of 
planets. 


2.6.2. Follow-Up and Parallax 


The other aspect that must be considered is the role of follow-up observations in mea¬ 
suring microlens parallaxes and so in putting individual events into the hnal sample used 
to measure the Galactic distribution of planets. There are two relatively distinct ways that 
this can happen. First, the planetary-induced features in the light curve may substantially 
increase the precision of the parallax measurement from ground-based observations alone and 
so make the difference between whether it is included in or excluded from the sample. Sec¬ 
ond, the follow-up observations may improve the precision of the non-planetary light curve 
parameters {to,uo,tE)® (i-e., the time of maximum, the impact parameter, and the Einstein 
timescale) and thus improve the precision of the parallax measurement that comes from the 
comparison with Spitzer data. We discuss these in turn. 


An & Gould (2001) argued that events with three peaks (features due to caustics in¬ 


duced by a companion) would gain signihcantly improved parallax measurements relative 
to otherwise similar point-lens events. However, in the intervening years, almost nothing 
has been done to investigate the role of the perturbations in the parallax measurements for 
detected planets. For example, early modeling showed that the immediate post-peak light 
curve of MOA-2009-BLG-266 yielded a surprisingly good parallax measurement, despite the 
fact that it is extremely rare for orbital parallax to be measurable before an event substan¬ 
tially returns to baseline. This was attributed to the sharp deviations in the light curve 
caused by a Neptune mass-ratio planet. Yet Muraki et ah (2011) say that the dominant 
source of the = 2789 parallax signal derives from the MOA data in the wings and that 
very little parallax signal comes from the perturbed region. However, we hnd from htting the 
MOA data alone (and excluding the perturbed region) that the parallax signal from these 
data is only Ay^ = 205, implying that the perturbation could in fact play a major role in 
the strength of the parallax signal, as An & Gould (2001) anticipated. 
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It is very likely that planetary perturbations play a significant role in the strength of 
the parallax signal in many other events as well. This is likely to be a partial explanation 
for the fact that roughly half of all microlens planets published to date have measured 
parallaxes. Although historically, there has been a lack of interest in where the parallax signal 
comes from, with respect to Spitzer observations, this question is of cardinal importance. If 
planetary events have more easily measured parallaxes than non-planetary events, then the 
sample of objects with measured parallaxes is biased. 


Hence, in determining whether the event enters the sample, it is essential to ask whether 
it would have a sufficiently well-measured parallax even if there had been no planet. This 
means both eliminating follow-up observations that were triggered by the presence of a 
planet and also (for this purpose) replacing the actual light curve with a fake, point lens 
light curve based on the event’s parameters (to,uo,tE)- This fake light curve could then be 
fit to determine the strength of the parallax signal from the point lens event. This procedure 
must also be applied in cases in which the planetary perturbation is seen from Spitzer. 


A closely related issue is that an incipient planetary anomaly might be misinterpreted as 
evidence for an approach to high magnification, and hence trigger an “honest” (i.e., seemingly 
non-planet-related) decision to observe the event, either ground-based follow-up observations 
or observations by Spitzer. This occurred, for example, for ground-based observations of the 
second microlensing planet, OGLE-2005-BLG-071 (Udalski et al. 2005). Such observations 
by Spitzer might enable a parallax measurement that would not have been possible for a 
point-lens event. This must be checked in all cases, which again can be done through the 
use of fake light curves. 


By contrast, it will be relatively rare for follow-up data to play a major role in the 
determination of the event’s point-lens parameters simply because the main features of the 
event that enter the parallax measurement usually derive from long-term observations and so 
are well measured from survey data. However, high-magnification events can be an exception, 
primarily because dense coverage of the peak is often required to determine the ground- 
based impact parameter, particularly if the surveys cover the field at low cadence. This was 


exactly the case for OGLE-2014-BLG-1049, one of the 21 events analyzed by Galchi Novati 


et al. (2015). However, in the great majority of cases (including this one), an improved 


parallax measurement is simply one of the benefits of conducting follow-up observations 
of parallax candidates. The primary motivation is generally the increased probability of 
detecting planets (Griest & Safizadeh 1998). The only exception would be if these critical 
near-peak observations were triggered by the known presence of a planet (rather than the 
hope of finding one). We expect such planet-triggered parallax-assisting measurements will 
be extremely rare and mention them primarily for completeness. 
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3. Principal Ingredients for Developing Selection Criteria 

At a fundamental level, there are only two considerations in deciding whether to choose 
one event over another for observations. First, how sensitive to planets is that event? 

Second, how likely is it that if Spitzer observations are undertaken, a microlens parallax 
will actually be measured? Thus we may schematically define a “quality factor” Q for the 
experiment 

= (3) 

i 

where Si is the planet sensitivity of the ith event chosen, and P* is the probability that 
it will yield a parallax measurement. Then the goal in developing selection criteria should 
be to maximize Q. One issue posed by Equation (|^ is that planet sensitivity is actually 
a function of planet properties. However, this is easily resolved by adding two additional 
indices to Si to specify these properties. We return to this complication in Section |4.3[ A 


more fundamental challenge is that both Si and Pi may be poorly known at the time that 
the decision must be made to initiate observations (c.f. Figure [^. The first step toward 
figuring out how to proceed in the face of these uncertainties is to review what makes an 
event sensitive to planets and how much about this can be known at any given stage in its 
evolution, and to then address the corresponding questions about the measurability of its 
parallax. 

4. Planet Sensitivities 

After an event is over, its “sensitivity” to planet^ can be rigorously defined as a function 
of two variables, the planet-star mass ratio q and the planet-star separation s (in units of 
6-e). See, for example, Gaudi & Sackett (2000), Gaudi et ah (2002), Gould et ah (2010), and 


Gassan et ah (2012). However, when choosing an event for additional observations while it is 


evolving, one must be guided by a more qualitative understanding of what properties make 
the event sensitive to planets and judge how likely it is that these will appear. 


4.1. Ground-based Microlensing Observations 


Because microlensing events depend on the chance alignment of two stars at radically 
different distances, for the most part, these events cannot be predicted in advance. Hence, 
microlensing surveys monitor millions of stars toward the Galactic bulge, where the stellar 
density, and therefore the microlensing event rate, is highest. If the survey team sees a star 


^In other planet-finding contexts, e.g. radial velocities, “planet sensitivity” is often referred to as “detec¬ 
tion efficiency.” 
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brightening in a manner consistent with microlensing, they issue an alert announcing a new 
microlensing event. 


Microlensing surveys monitor these helds at a variety of cadences 
(e.g. http://www.astrouw.edu.pl/$\sim$jskowron/ogle4-BLG/). “High-cadence” helds 
are monitored one to several times per hour (e.g. column 2 of Figure [^, which is sufficient to 
characterize small planetary signals from terrestrial-mass planets. “Moderate-cadence” helds 
are observed several times per night, which can capture planetary signals from ice giants. 
“Low-cadence” helds are monitored once per night or less than once per night (e.g. column 
1 of Figure and are generally focused on producing alerts of ongoing microlensing events 
that can then be monitored more intensively by follow-up groups, although these survey 
observations themselves are occasionally sufficient to characterize large (gas-giant) planets. 


In the context of this paper, we will focus on survey data from two sources. The pri¬ 
mary data will come from the OGLE-IV survey (Udalski et ah 2015b), whose sky coverage 
is given in the above URL. In addition, we will consider data from the new Korea Microlens¬ 
ing Telescope Network (KMTNet) helds, which will be conducting its hrst year of routine 
microlensing survey observations. Under the assumption that these observations are carried 
out, we will include in our evaluations data from the four core KMTNet helds, which will be 
observed many times per hour. 


4.2. Planet Sensitivity: Qnalitative Features 


Microlensing detects planets around the lens stars as perturbations to a standard (point- 
lens) microlensing event. That is, the microlensing event is overall dominated by the gravita¬ 
tional potential of the lens star (host), which splits the light into two images whose position 
and size change as a function of time. If a planet interacts with one of those images, it 
creates a perturbation that distorts the shape and total magnihcation of that image, which 


can lead to a detectable signal. See Gaudi (2012), in particular Figures 4 and 5. 


The total “planet sensitivity” of a given event depends on two factors. The hrst is the 
intrinsic sensitivity of the event to planets. Larger planets, and planets that are closer to the 
“Einstein ring” (circle with angular radius 6-^) are easiest to detect. The larger the images 
are, the more sensitive they are to planetary perturbations. Hence, more highly magnihed 
events are more sensitive to planets, i.e., to smaller planets and to planets that are farther 


inside or outside the Einstein ring (Gould & Loeb 1992). The most extreme example would 


be the high magnihcation events (peak magnihcation of the underlying point lens event 
H(to) = Hmax ^ 100) in which the images form an almost perfect ring that probes a wide 
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range of separations and is easily pertnrbed (Griest & Safizadeh 1998). 


The second factor affecting planet sensitivity is how well the data cover potential pertnr- 
bations. The qnality of this coverage is dehned by two principal characteristics: cadence and 
photometric precision. Regarding the hrst, planetary pertnrbations typically last between a 
few honrs and a few days, so a cadence that is a factor of ~ 10 more freqnent than that is 
necessary to characterize those pertnrbations. Since snch pertnrbations may occnr at any 
time dnring the event, the sensitivity will be greatest if the data are continnons dnring this 
period. However, if observing resonrces are limited, then restricting continnons observations 
to the most highly magnihed (hence most sensitive) parts of the light cnrve may be the most 
prodnctive approach. Nevertheless, planets can appear in all parts of the light cnrve, even 
after the main event is over or before it began (e.g., Poleski et al. |2014b ), so that observations 
are never “wasted”. 


Photometric precision is mainly governed by sonrce brightness. This factor therefore 
favors intrinsically bright sonrces, bnt highly magnihed sonrces can also be bright (jnst at 
the moment that they are most sensitive to planets). 

Even thongh we have not yet examined the other key determinant of Q (i.e., the proba¬ 
bility of measnring tte. Section]^, we can already draw a few general lessons from the above 
analysis of planet sensitivity. First, higher peak magnihcation is the best single indicator 
for choosing events (provided that the peak region can be intensively monitored from the 
ground). Second, for objectively chosen events, those that are in high-cadence snrvey helds 
are snbstantially more valnable than those that are not. This is becanse high-cadence events 
can yield planets long before the onset of Spitzer observations, or even before the event was 
recognized as microlensing, whereas low-cadence events generally cannot. Third, for events 
in low-cadence helds that achieve relatively high-magnihcation, it is important to mobilize 
follow-np observations prior to peak. If these are chosen snbjectively, then the desirability of 
choosing this target will depend critically on the expectation that snch follow-np observations 
will be carried ont. 


4.3. Planet Sensitivity: Quantitative Determination 


The question of quantifying the planet sensitivity Si of each event is mainly outside 
the scope of the present paper because this can only be done after the event is largely over 
and hence after all of the observing decisions that are the subject of this paper have already 
been made. That is, these decisions must be made on the basis of the qualitative indicators 
discussed above. Nevertheless, for completeness we present here a broad overview of the 
relevant issues. 

Planet sensitivity is measured as a function of two parameters (s, q) and so can be 
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formally written These are two of the seven basic parameters that are the minimnm 
needed to describe a planetary event. Three of the others are (to, Wo, ^e), he., the parameters 
of the nnderlying point lens event. The remaining two are the angle between the sonrce-lens 
trajectory and planet-star axis, a, and the ratio of the sonrce radius to the Einstein radius, 
p = 6'*/6'e- 

Historically there have been two approaches to determining planet sensitivity. In the 


hrst approach (Rhie et ah 2000), one constructs an ensemble of planetary light curves that 


vary in a but are hxed in the remaining six parameters. The values of (to, wq, ^e) are adopted 
from the best £t of the single-lens event. We address the choice of p further below. The 
remaining two parameters are just those being tested (s, q). For each light curve, one creates 
fake data points at the times of each of the real measurements, with values equal to those 
predicted by the model and error bars equal to the those of the real data points. With the 
adopted parameters, the £t to a fake curve without a planet would be “perfect”, i.e., = 0, 

so any in excess of this value must be due to the planet. One then hts these fake data 
to a point-lens model. Therefore, if the is above some threshold (perhaps > 200 for 
events with moderate magnihcation, Hmax < 100, Yee et al. 2012) then the planet is said to 


be detectable. The fraction of all the a at hxed (s, q) for which the planet is detectable is 
then said to be the sensitivity . 


In the other approach (Gaudi & Sackett 2000), one hts the actual data with planetary 
models with the same sampling of parameters, and measures the Ay^ improvement between 
the planetary model and the best-ht point lens model. This method is more time-consuming 
but has the advantage of simultaneously searching for all planets that may be lurking in the 


data down to the adopted threshold (Gaudi et al. 2002). 


The choice of p is a subtle one. For planetary events, one often measures p from the 
smearing out of the light curve as the source passes the sharp edge of a caustic. Hence, 
when constructing fake planetary light curves, one must insert some value of p even though 
this quantity is very rarely measured in point-lens events. The problem is that while 9^ is 
usually well-determined from the color and magnitude of the source (e.g., Yoo et al. 2004), 


is not known. We do not review the various methods used to estimate p in the past but 
simply note that for Spitzer events, 6 *e = 'n'rei/'^E is usually known quite well because tte is 
measured and vTrei is well constrained ( Galchi Novati et al.||2015 ), allowing a well-constrained 
estimate of p = 6**/6 *e. 


Gurrent microlensing experiments have far too few detections to constrain the full two- 
dimensional distribution of planets as a function of {s,q). However, for comparing to data. 
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one can marginalize over one or both indices, e.g., 

/ OO 

dlogsS*’; Sfi"- 

•OO 


/ <i logos’. 

J —OO 


( 4 ) 


There is one fnrther issne that has not been previonsly considered in the literatnre bnt 
is qnite relevant here. In the above-described procednres, it is implicitly assnmed that the 
observations were carried ont withont reference to the presence or absence of a planet. This 
has usnally been the case, and in the one notable example that it was not, the anthors took 
the tronble to remove the extra observations that were triggered by the presence of the planet 


(Yee et al. 2012). See also Poleski et ah (2014a). However, for S'pifzerevents that are chosen 


snbjectively, a large fraction of the nominal planet sensitivity may be dne to observations 
before the pnblic annonncement. 


However, as discnssed in Section |2.6.1[ there is the possibility that a planetary per- 
tnrbation in the early stages could trigger additional observations or selection before the 
perturbation is well-understood or even recognized, e.g. if the perturbation was mistaken 
for a rise toward high-magnihcation. To determine which hypothetical planets should be 
excluded from the planet sensitivity, we suggest that the following additional test be con¬ 
ducted for each hypothetical planet that is regarded as “detected” based on the full light 
curve: truncate the fake-data light curve at the decision date and £t only the points prior to 
this date to a point-lens model. We suggest that if < 10, then the signal from the hypo¬ 
thetical planet would be too small to trigger either follow-up or Spitzer observations. Hence, 
the hypothetical-planet detection should be accepted, but otherwise it should be rejected. 
This limit is chosen because in our experience such Ay^ < 10 deviations are extremely 
common and so cannot possibly trigger resource-expending actions. However, substantially 
higher Ay^ might well trigger an unconscious Spitzer decision announcement. 


5. Probability of Measuring Parallax 


While parallax measurements derive from a combination of ground-based and space- 
based data, the limiting factor will be Spitzer data in almost all cases. The main reason is that 
the S'pifzer observations of any given event are restricted to 38 days by Sun-angle constraint^ 
and these 38 days fall in an arbitrary part of the light curve. Second, both the cadence and 
quality of the data are very likely to be lower than the ground-based data. Finally, if 
the parallax is large enough, the event as seen from Spitzer may pass entirely outside the 
Einstein radius during the Spzfzer observations and so be effectively unmagnihed. Hence, the 


'^In practice, the spread in targets over a few degrees in the Bulge allows us to stretch the time-frame of 
the campaign to 40 days. 
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probability that tte will be measurable reduces in essence to probability that adequate Spitzer 
observations can be obtained. Hence, most of the discussion about “measuring parallax” is 
rooted in the specihc nature and procedures for Spitzer observations. However, we begin by 
briefly discussing what it means to “measure parallax”. 

5.1. Meaning of “Parallax Measnrement” 

Quantities are usually said to be “measured” if a numerical value can be assigned to 
them with some error bar and if this value is determined to be inconsistent with zero with 
some specihed level of conhdence, e.g., 3 a. For space-based microlensing parallaxes, there 
are circumstances in which upper and lower limits are sufficiently constraining, and therefore 
the dehnition of a “measurement” requires explicit discussion. 

For example, suppose that an event is observed from the ground with (Mo;^E)e = 
(0.2,10 day) and with fo,e within the window of Spitzer observations, but the Spitzer light 
curve is completely flat. Also suppose that the source is bright enough that 10% variations 
in its flux would have been detected. From these (lack of) measurements, together with the 
fact that Spitzer was ~ lAU from Earth at the time of observations, one could conclude that 
tte > 1.5, but no specihc value of tte could be assigned. Although not a “measurement” by 
traditional standards, this lower limit would be highly constraining. That is, it would imply 
that the projected velocity would be h = AU/(7rEtE) > 115kms“^, implying that the lens 
was very likely in the near disk, TTrei > 0.1 mas. Hence, if there were no planet discovered, 
the distance would be statistically well enough constrained to enter the cumulative distance 
distribution function. If the event proved to have a planet, then it is likely that p (and so 
6 'e would be measured), which would permit a strict lower limit on TTrei = ^^e^te and a strict 
upper limit on M = 6-e/ uti-e- 

At the opposite extreme, if for the same (%, Mq, tE)e, the Spitzer and Earth-based 
lightcurves appeared identical, this would be consisent with tte = 0, which in traditional 
terms might be considered as “no measurement”. However, if this consistency were quite 
tight, say tte < 0.01, then the projected velocity would be constrained v > 17,300kms“\ 
implying that the lens certainly lies in the bulge. 

Hence, the hnal sample must be dehned as events that yield true measurements or either 
upper or lower limits on tte (or both). The exact limits cannot be dehned in advance because 
there is not yet enough experience with Spitzer parallax measurements to determine what 
are reasonable limits. 


5.2. Spitzer Procedure 


Spitzer observation sequences can only be uploaded to the spacecraft once per week. 



and hence targets can only be changed on this timescale, and furthermore, the entire week 
of observations must be planned in advance. In addition, it takes several days to prepare 
the observation sequence for upload to the spacecraft even after the targets and observation 
sequence have been set. The net result is that the targets and sequence are set 3-10 days 
before the observations are actually carried out. See Figure 1 of Udalski et ah (2015). 


In light of these considerations, and to facilitate the discussion, we define several vari¬ 
ables summarized in Table First, we dehne fydec to be 6 hours prior to the time that 
observing choices must be forwarded to Spitzer operations for a given observing “week” j, 
i.e., fydec = Monday UT 15-6 hrs. Experience shows that this is the latest time that new 
information can be reliably incorporated into the observing request without risking the in¬ 
troduction of serious errors. Given the day-of-the-week constraints, the ~ 40 day campaign, 
and a start date of June 3rd, j takes on values from 1 to 7. 


The time of the first possible Spitzer observations of a given event is defined as ffirst- 
For simplicity, we will let tfirst be when those coordinates could first be observed by Spitzer, 
even if the event is not discovered until afterward this date. Finally, we define tynext as the 
time of the hrst possible observation that can be requested at fydec, and we dehne t^n as 
the hnal possible observation of an event before the Spitzer observing season ends (because 
of Sun-angle restriction and/or the end of the allocated observations). Note that fydec and 
^ynext change each week, while ffirst and Gn are dehned for each particular event (set by the 
38-day Sun-angle constraint). In general, for events selected for the hrst week of observations, 

tl,next 

5.3. Spitzer Observation Cycles 


Generally speaking, we expect most or all of the available Spitzer time to be devoted 
to this program during the ~ 40 day observing window. Hence, given continuous observing 
time, Spitzer observations in a given week can be carried out most efficiently if the targets 
are organized in concatenated “cycles” moving West-to-East through the Southern bulge 
and then East-to-West through the Northern bulge. Each event can then be given a priority 
n, which designates that it will be observed each 1/n cycles. That is, if n = 1 it will be 
observed every cycle through the Bulge, and if n = 8 it will be observed every eighth cycle. 
We expect approximately eight cycles per day, each lasting ~ 2.4 hours, with the exact 
number determined by the total observation time allotted and the total number of targets 
per cycle. This is discussed in more practical detail in Section 

5.4. Spitzer’s Role in Parallax Measurements 

As originally conceived, the standard way to measure satellite parallax was to observe 
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the peak of the light curve from the satellite and the full light curve from the ground. This 
requires only partial, but very specihc, light curve coverage from space. Therefore, one of 
the goals of Spitzer observations is to try to capture this peak in as many cases as possible. 
Although this measurement is nominally still subject to the four-fold degeneracy in tte 
( R efsdal||1966 ; Gould 1994), Gould ( |1995 ) showed that these degeneracies could be partially 
or fully broken by measuring the very small difference in as seen from the two vantage 
points, and this idea was then investigated in extensive simulations (Boutreux fc Gould|1996 


Gaudi & Gould 11997). Hence, this goal of observing the peak of the event guided the 2014 


Spitzer campaign. 

For many years it was believed that because of the four-fold degeneracy, parallax mea¬ 
surements would not be possible if the satellite observed only the rising or falling side of 
the event, but did not capture the peak. Goverage of the peak would in fact be required if 
one needed to derive independent point-lens parameters (to,uo,tE) from ground-based and 


space-based observations (as is necessary in Gould 1995 to break the degeneracy). However, 


Galchi Novati et ah (2015) showed that the four-fold degeneracy can usually be broken by a 


combination of the so-called “Rich argument” and kinematic priors derived from a Galactic 
model. Once the problem of the four-fold degeneracy is removed, the requirements on the 
satellite data are drastically reduced. 


First, tE,sat can be regarded as “essentially known,” so that it is only necessary to 
determine two satellite parameters (to,sat, Wq, sat) to measure the parallax. Of course, ^e is 
actually slightly different as seen from Earth and the satellite because they have a relative 
motion of ~ 30 kms“^ in the East direction. However, the resulting difference in is directly 
determined by tte, so while it is not strictly the case that t-^ is irrelevant, it remains true 
that only two independent light curve parameters must be derived from the satellite light 
curve. 


Second, the source flux parameter fs^sat for the satellite can be determined independently 
of the satellite light curve using a color-color relation derived from held stars combined with 


the measured color {I — H or V — I) derived from the ground-based light curve. Galchi 


Novati et ah (2015) obtained typical precisions for /^ sat of 5% in the cases for which they 


had good H or V data. However, it remains necessary to determine fb,sat from the light 
curve, which constitutes a third parameter that must be derived from the Spitzer data. 

Then, from simple parameter counting, it is in principle enough to measure three non- 


colinear points on the light curve to measure the parallax (Dong et ah 2007), e.g., one point 
that is “known” to be at baseline and two others at different magnihcations. In practice, 
more points are usually needed to have conhdence in the measurement and to have checks 
against discrete degeneracies. However, it would be enough, for example, to track the falling 
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part of the light curve from the time that the source exited the Einstein ring until it had 
dropped by 30% in magnihcation, i.e., approached baseline. 


For events that are well before peak as the Spitzer window ends, the situation is less 
straightforward because there would probably not be any baseline and the short duration 
of the observations might not yield any measurable change of slope. However, such events 
could be recovered by post-event baseline observations, either six months later (when the 
Bulge is not visible from Earth, so ordinary satellite parallax observations are not feasible) 
or the following year. Hence, ffin could be considered as a date in the distant future rather 
than the end of the current 38-day observing window. 


Therefore, there are two different channels through which parallaxes can be measured 
with Spitzer for point lens events. First, Spitzer can observe just the peak of the light 
curve. Second, Spitzer can observe either the rising or the falling side of the event plus some 
measurement of the baseline. Finally, if the event has a binary lens, features from the binary 
may be used to measure the parallax. However, this situation is more complex since binary 
perturbations last long enough that they may not be fully captured by the Spitzer data and 
so the four-fold degeneracy may persist (Zhu et ah 2015). 

5.5. Spitzer Photometric Pipeline Issues 


The feasibility of measuring tte from a given set of Spitzer observations obviously de¬ 
pends on the quality of the photometry that can be extracted from these observations. 
Remarkably, none of the wide range of publicly available bpzteer-specihc photometry pack¬ 
ages is well matched to the problem of time series of variable stars in crowded helds. As a 
result, the limits of what can be achieved from such photometry are not well understood. 

All the main elements required to solve this problem are at hand, but they have not 
so far been combined. First, the Spitzer pixel response function (PRF) is extremely well 
understood. That is, if a point source has a known flux and known position relative to the 
optical axis, then the response of all pixels can be predicted to much higher precision than 
is relevant for the relatively faint sources that are studied in microlensing experiments. The 
positions and fluxes of the great majority of sources in the microlensing helds are known 
to be constant on the timescales of the 38-day Spitzer observing window. Moreover, the 
locations of all held sources that are bright enough to be relevant are known from ground- 
based optical astrometry at much higher precision than is needed for Spitzer photometry, 
while the location of the microlensed source is typically known with even higher precision in 
this optically based frame. Even the approximate 3.6 pm huxes (other than the microlensed 
source) are known from optical V/I photometry and fairly robust local-held V/1/3.6pm 
color-color diagrams. Hence, a conceptually straightforward procedure would be to forward 
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model the ensemble of n images with one flnx parameter for each non-microlensed sonrce and 
n flnx parameters for the microlensed sonrce. Intrinsically variable stars conld be recognized 
as poor hts in this process and either ignored (if they were snfhciently far from the lensed 
sonrce) or modeled with n parameters instead of just one. 


We are working on such a pipeline, but since criteria for 2015 observations (beginning 
in June) are required several months in advance, we must assess likely Spitzer performance 
based on applying existing pipelines to 2014 Spitzer microlens data. These each contain 
some (but not all) of the advantages of the ideal pipeline outlined above. For example, the 
MOPEX pipeline fully incorporates the PRF but does not hold stellar positions constant, nor 
does it hold the flux of held stars constant. The well-known DoPhot (Schechter et ah 1993) 
pipeline can be applied to images formed by combining the six 30s dithered images at each 
epoch. It can hold stellar positions constant but does not incorporate any information about 
the PRF. We also applied a variant of the ISIS pipeline, which uses image subtraction to the 
same combined images. Although this pipeline normally outperforms DoPhot for ground- 
based microlensing data (with some exceptions), we hnd that the lack of PRF information 
generally affects ISIS more adversely than DoPhot. 


We conduct a purely empirical investigation, using 47 events from the 2014 Spitzer 
microlensing “pilot program” that have enough points to potentially construct a coherent 
light curve. We consider the photometry from MOPEX, DoPhot, and a preliminary version 
of our own pipeline. We create an optically-based effective Spitzer (“L-band”) magnitude 
(since prior to obtaining Spitzer data we have no independent knowledge of the true Spitzer 
flux). This is dehned 


Leff = / - O. 93 A 7 - 1.3 -7 0.50(1 -Ai- 17.2), (5) 

where A/ is the extinction in J-band ( Nataf et al.||2013 ) and 0 is the Heaviside step function. 
We stress that no precise physical meaning should be attached to Leg- It is simply an 
approximate predictor of the Spitzer flux based on optical data. The 0 function divides 
all stars into two types: turnoff stars (0 = 1) and low-luminosity giants (0 = 0). The 
justihcation for this approximation is that signihcantly fainter (and redder) dwarfs generally 
will not enter our sample and signihcantly brighter (and redder) giants are very rare. Of 
course, by limiting ourselves to two classes of stars we are still ignoring evolution over the 
sub-giant branch. However, in the general case, it is not possible to make a hner distinction, 
particularly before a detailed investigation of an individual event has been made, as is almost 
always the case when one must make the decision about whether to monitor a particular 
event. 


Using this proxy, we hnd that it is usually not possible to obtain good photometry 
with existing software unless there is at least one point with Lgff < 15.5. We therefore use 
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this criterion as our principal guideline for deciding whether parallaxes can be measured for 
particular events. This may appear too conservative in that there will almost certainly be 
photometry improvements by the time that the data are analyzed. On the other hand, when 
making decisions about Spitzer observations, one must use the simplihed “assumption” that 
Spitzer will see the same brightness source star as it would if it were observing from Earth 
because the true magnihcation as seen from Spitzer is unknown. That is, the whole point of 
the experiment is that the Spitzer and Earth-based light curves will differ by an intrinsically 
unpredictable amount. In particular, the source could be less magnihed as seen from Spitzer 
than from the ground. Thus, we adopt Lefr < 15.5 as a good balance between these two 
considerations. 

An additional consideration, given that the IRAC pixels are 1.2", is that the target 
may be blended with other stars in the crowded held, which can affect the quality of the 
photometry. The severity of this blending depends both on the separation of the blend from 
the target and on their relative huxes at 3.6/xm. While the separation of potential blends 
can generally be determined from existing, higher-resolution, ground-based data, the relative 
huxes cannot. Furthermore, in part because of the problems with the photometric pipelines, 
we have not been able to determine exactly what criteria can be used to assess whether or 
not a given blend star will cause a problem for the photometry. Hence, while we are aware 
of this issue, it is not possible to account for it at the present time. 

6. Objective Criteria 


As discussed in Section 2.1, there is an extremely strong reason for choosing as many 
events as possible based on purely objective criteria: all planets (and planet sensitivity) from 
the entire event can be included in the sample. However, there is also a huge potential for 
wasted Spitzer observations if these criteria are not sufficiently restrictive. Hence, we have 
opted for a conservative approach. 


An important point to keep in mind is that for events in low-cadence helds, there is 
no major advantage to selecting the event objectively because such events have very little 
sensitivity to planets in the absence of follow-up data (Section |^. Their sensitivity will 
only be substantial if higher-cadence (usually ground-based) observations are triggered. If 
this recognition also triggers Spitzer observations (or rather, commitment to such observa¬ 
tions) at the same time, then essentially no planet sensitivity is lost. Column 1 of Figure 
demonstrates that these low-cadence events are also extremely hard to predict. 


Another point to keep in mind is that it is substantially easier to predict the future 
course of events that have already peaked than those that are still rising (compare panels le 
and 2e to earlier panels in Figure]^, and hence to estimate accurately whether a successful 
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Fig. 2.— Flowchart illustrating the process for objectively selected events. An event may be 
objectively selected either before or after the peak, but must meet all of the selection criteria 
for that category. 
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parallax measurement can be made. This fact is especially important for events that have 
peaked before the Spitzer campaign has begun. For events that peak during the campaign, 
the probability of measuring tte can be substantially enhanced if Spitzer observations are 
made over peak, i.e., before such secure information about the event parameters is available. 

Guided by these considerations (and others related to subjective selection that are dis¬ 
cussed below), J.C.Y. and A.G. developed some preliminary objective criteria, and then 
(independent of these criteria) each individually analyzed 242 events based on OGLE and 
MOA data obtained up through June 3, 2013, which is the analogous time to the hrst de¬ 
cision time in 2015. These events had been pre-selected based on very loose criteria from 
about 1000 events that had been found by these collaborations by this date. For each event, 
they decided whether it should be chosen for hypothetical Spitzer observations to begin three 
days later, and if so at what cadence. All disagreements were discussed and the final joint 
decisions were subsequently evaluated based on comparison to the full 2013 light curves. The 
agreed-upon sample contained all nine events that were selected by the preliminary objective 
criteria, and also many that were not for a total of 44 events. Based on this detailed anal¬ 
ysis J.G.Y. and A.G. refined the objective criteria for selection and the objective cadence 
choice, both of which are listed immediately below, and also developed general guidelines to 
subjectively choose events, which are discussed in Section 

Figure [^summarizes the process for objectively selecting events. In this scheme, events 
may be divided into two categories: events that have already peaked (f = fo + 2 days) and 
events “before” the peak (f < to + 2 days). In order to be objectively selected, the event 
must meet all criteria for the appropriate category. We begin by discussing events that have 
peaked, because they are generally better understood (i.e. the model fits have converged 
and their future behavior is well-constrained). 

6.1. Events that have Already Peaked 

Al) to + 2 days < fj^dec 
A2) Either 

a) in an OGLE field w/ cadence > lOday”^, or 

b) in a “core” KMTNet held, or 

c) Amax > 3 and in an OGLE held w/ cadence = 3day“^ 

A3) Js-2.51og(A^ax) < 17 

A4) A(fj^next) ~ A(ffin) >0.3 
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A5) Ls,eff - 2.5 log [A (tj-next)] < 15.5 
A6) Ls,eS - 2.5 log[A(tj- next) - 2l(tfin)] < 17 


Criterion (Al) is simply a practical definition of “post-peak”. Criterion (A2) selects 
for events that have signihcant planet sensitivity. For events in high-cadence helds this is 
essentially any event (provided it meets the other criteria) because planets can be discovered 
in these events far out into the wings and even at baseline. For other, somewhat lower- 
cadence events, the criterion demands A^ax > 3 as a minimum indicator that the Spitzer 
parallax observations will be worthwhile. However, for events in low-cadence survey helds, it 
is essential to carry out ground-based follow-up observations to gain substantial planet sen¬ 
sitivity. Hence, for events in low-cadence helds, there is no triggering of Spitzer observations 
via objective criteria, and it will therefore be necessary to subjectively select these events 
prior to peak (which may be before the Spzteer observing window). 

Criterion (A3) demands that the peak hux from the magnihed source (not including 
any blended light) be / < 17. It increases the likelihood that the ground-based photometric 
precision will enable good planet sensitivity. To date, the overwhelming majority of planetary 
microlensing events have peaked / < 17. In addition, this criterion is important not only to 
secure an accurate ht to the light curve but to permit application of color-color relations to 
determine the Spitzer source hux, both of which ahect the hnal parallax measurement. 


Criterion (A4) is driven by the fact that parallax measurements require a well-measured 
change in magnihcation as seen from Spitzer. In practice, this means both measuring a 
hux change and independently determining the Spitzer source hux. For observations that 
begin past peak, it is impossible to reliably ht the Spitzer light curve for the source hux, so 
it must be determined from color-color relations, which can be done reliably to about 5%. 
Hence, we require at least a 0.3 change in magnihcation (i.e., 6-fold larger than 5%) based 
on our estimate of what will be required for reliable parallax measurements (Section 5.4). 
This criterion is equivalent to demanding that the source is still in the Einstein ring (at the 
time of the next possible observation) for the case that the hnal possible observation is well 
outside the Einstein ring. 


Criterion (A5) derives directly from the difficulty of extracting Spitzer photometry un¬ 
less at least one point is brighter than < 15.5 (Section |5.5[ ). Criterion (A6) demands a 
minimal hux change in Spitzer hux. It will be automatically satished for the great major¬ 
ity of stars that satisfy criterion (A5) and is included to guard against including (at least 
automatically) events that are not predicted to change much over the remainder of the obser¬ 
vations. As noted in the justihcation for criterion (A3), measurable hux changes are crucial 
for parallax measurement. Criterion (A6) ensures that not only the magnihcation changes. 
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but the flux itself changes by a significant amount. 


6.2. Rising Events 


Bl) to > tdec - 2 days 

B2) in OGLE held w/ cadence > 3day“^ or in a “core” KMTNet held 

B3) /now < 17.5 

B4) /base /now >0.3 

B5) LefF,dwarf,now /now 0.93^4/ 0.8 < 15.5 

Criterion (Bl) is just the practical definition of a rising eventj^ (i.e., the complement of 
criterion (Al). 

Criterion (B2) restricts this entire class of objective selection to fields with moderate-to- 
high cadence, > 3day“^, to ensure that the selected events have good sensitivity to planets 
over the entire light curve. The primary concern is that once an event has met the full set 
of rather restrictive criteria, it will be close to the peak and a large fraction of the event will 
be over. As discussed in Section without substantial follow-up data, low-cadence events 
have very little practical planet sensitivity. Hence, it is not worthwhile to observe them with 
Spitzer to try to measure a parallax because they will add very little to the final analysis 
of the planet occurrence rate as a function of Galactic distance. We note that most high- 
quality events that fail this criterion can be selected subjectively (Section]^ well before peak 
with the specific goal of triggering additional follow-up observations to improve the planet 
sensitivity. 

The remaining three criteria make no reference to a light curve model and instead rely 
on purely empirical observables. Again, this is because experience shows that such models 
are not reliable for pre-peak light curves. All three make reference to “/now” which is the 
last measured OGLE point. 

Griterion (B3) assures that the event will be bright enough for accurate measurements 
(necessary for both planet sensitivity and parallax). Griterion (B4) assures that even if the 


^Technically, this also includes events < 2 days past peak. The reason for this choice is that it can be 
difficult to be confident that an event has indeed peaked unless there are data after the peak to demonstrate 
this explicitly. 
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source is not blended, the event has risen at least 32% above baseline, i.e., the source must 
be (nearly) inside the Einstein ring. The combination of (B3) and (B4) ensure that even 
if the source turns out to be heavily blended, then at least the magnified flux will change 
signihcantly compared to its present value relative to a future baseline measurement, which 
will enable a measurement of the parallax. Finally, criterion (B5) attempts to assure that 
there will be at least one point above the photometric threshold for measuring a Spitzer light 
curve (Section |5.5[ ). This is only “attempted” (rather than guaranteed) since the event may 
not be as magnihed from Spitzer’s vantage as from Earth. Because the source magnitude is 
most likely not known at the time of this algorithmic selection, we conservatively assume it 
is a dwarf (i.e., relatively blue and so fainter as seen by Spitzer for hxed /-band brightness). 


6.3. Objectively Determined Cadences 


As we have discussed, events that are selected objectively must have objectively deter¬ 
mined cadences. In practice, cadences are actually defined by “priorities”, where priority n 
means that the event is observed during 1/n of the cycles through the microlensing helds 
(Section 5.3). However, we state these here in terms of cadences, since there is a clear- 
cut conversion from one to the other once the target sample is selected. We designate the 
following algorithm for setting the observation cadence for Week j 


Cl) Default cadence: 1 day“^ 

C2) 2day~^ provided that all of the following are true 

(a) This is hrst Spitzer observation period of the event, and 

(b) tj YiQxi > fo) snd 

(c) A(fj-next) < 1-35 

C3) 2 day~^ for events beginning the when less than two full weeks remain 
C4) Stop observing the event, provided that all of the following are true 

(a) At least 2 weeks of objectively-determined observations are complete, and 

(b) tj,next > to + tE 

(c) tj+l^next ^ ^0 T 

C5) Stop observing the event, if either 
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(a) The parallax of an event has been measnred from Spitzer data already collected, 
or 

(b) The Spitzer light curve has already reached baseline (so no more parallax infor¬ 
mation could be extracted from additional observations) 


Criterion (Cl) has been shown by Calchi Novati et ah (2015) to be generally adequate 
to make parallax measurements. However, for events that are leaving the Einstein ring as 
the Spitzer observations begin (C2) or for which there is only a short rising observational 
sequence at the end of the Spitzer window (C3), the cadence is doubled. These events 
have signihcantly more restricted light curve coverage than the typical events analyzed by 
Calchi Novati et ah ( |2015 ) and therefore require higher cadence to obtain more points (so 
higher signal-to-noise ratio) while the event is still signihcantly magnihed. (C4) imposes 
a reasonably conservative criterion for halting observations. In principle, this may cause 
Spitzer to miss a key portion of the light curve because it can in principle peak either earlier 
or later as seen from Spitzer than the Earth-based light curve would predict. However, 
because bulge lenses have small microlens parallaxes, the light curve peaks as seen from 
Earth and Spitzer are very close in time. On the other hand, for disk lenses, the Spitzer peak 
is usually earlier (or not much later) than from the ground because these disk lenses tend 
to move in the direction of Galactic rotation, i.e., about 30° East of North, whereas Spitzer 
is roughly due West of Earth. See Figure 2 of Calchi Novati et al. (2015). Finally, (C5) 
provides more specihc conditions for halting observations if the Spitzer data can be reduced 
and analyzed in real-time. 


6.4. Binary Events 


For completeness, we also specify the objective selection criteria for binary events. Un¬ 
like planetary events, binary events show prominent anomalies that modify the single-lens 
light curve signihcantly. Therefore, most binary events can be recognized in advance, and 
the inability to model them with single-lens light curves makes all selection criteria based on 
single-lens modeling (i.e.. Sections 6.1 and 6.2) fail in most cases. As a consequence, in the 
2014 pilot program we subjectively selected binary events, such as OGLE-2014-BLG-1050 
(Zhu et ah]|2015), for observations because their nature as binary events had been conhrmed. 


Therefore, in order to enable statistical studies of stellar binaries one has to have objec¬ 
tive selection criteria. After reviewing those binary events from the 2014 season, we decide 
to use the following criteria and cadence: 


1. Begin Spitzer observations if 
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(a) The ground-based light curve is in a U-shaped trough, and 

(b) trough ^ 16. 

2. End Spitzer observations 

(a) Either: 

i. One full week after the Spitzer light curve exits the caustic, or 

ii. Both: 

A. One full week has passed since the ground-based light curve exits the 
caustic, and 

B. The Spitzer light curve is shown never to have entered the caustic. 

3. Default cadence of 1 day“^. 


7. Guidelines for Subjectively Chosen Events 


We have already discussed in Section [272] the various reasons one might want to subjec¬ 
tively select an event and how that might affect the type of subjective selection (Table [^. 
Figure [^summarizes the decision process that can lead to subjective selection of an event. 


The most straightforward type of subjective selection is “Subjective, immediate” in 
which an event is immediately selected for observations, committed to, and has its cadence 
specihed. Such a decision may be made at any time, including before the start of Spitzer 
observations. There are two primary types of events that might be selected this way. First, 
events that are discovered and well understood before fj^dec but might not meet the objective 
criteria. Committing to observations immediately allows more of the planet sensitivity of 
that event to be captured. The second type of event is a well-understood, low-cadence event 
that requires follow-up observations in order to capture signihcant planet sensitivity. 


However, we expect that the great majority of events will be selected under the categories 
“Subjective, conditional” or “Subjective, secret” because at time t^^dec there is no way to 
distinguish between events that will reach moderate or higher magnihcation (Amax > 3) 
sometime before the start of the next cycle of Spitzer observations, fj+i,next (i-e., fydec + 
10 day), and events that will turn over at low magnihcation and low hux levels (e.g. compare 
the two hts in panel lb of Figure]^. Moreover, even among those that are likely to achieve 
satisfactory magnihcation, it cannot be decided automatically whether suitable follow-up 
resources can be allocated to a specihc event, given a wide variety of operational constraints. 
And hnally, it may be impossible to determine which of these events will get to high or 
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Fig. 3.— Flowchart illustrating the process for subjectively selecting an event. By dehnition, 
this is less quantitative than the process for objectively selecting an event (see Figure [^, 
but the underlying considerations are the same in both cases, namely: “Does the event have 
good planet sensitivity?” and “Is this event likely to yield a Spitzer parallax?”. 
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very high magnihcation based on routine survey data. However, additional investigation, 
including additional follow-up data and/or color information from survey and/or follow-up 
observations may resolve this question. 

As we have already discussed above, subjective decisions may be made before or after 
ffirst; but they must specify the cadence (or cadence algorithm) at the time of commitment. 
To be robust, any algorithm must be based on readily available data, such as the /-band 
light curve and the /-band held extinction. 

Given the above factors, together with the fact that planet sensitivity is heavily skewed 
toward higher-magnihcation events, it is inevitable that Spitzer observations will be triggered 
for a large number of events of uncertain prospects, and therefore that the majority of these 
must be terminated promptly after the event fails to rise to the level that permits signihcant 
planet sensitivity and also permits its parallax to be measured. Otherwise, a substantial 
fraction of observing time will be wasted on useless events. 

Hence, the principal issue will be deciding how to frame this failure in either of the two 
cases (“Subjective, conditional” or “Subjective, secret”). This will have to be done on a case- 
by-case basis at the time the events are subjectively chosen, since the uncertain nature of 
such events makes it impossible to develop strict guidelines in advance. In the hrst case, this 
framework must provide a guide to terminating observations (e.g., “stop observations next 
week if event does not reach I < 16.2 by the next decision point, tj+i,dec”)- second, 

it must provide a guide as to whether or not the team should commit to observations of the 
event. 

To the extent possible, the criteria governing such terminations (or continued observa¬ 
tions) should be framed in terms of post-peak observables because these are more robust. 
They should then be formulated as proxies for the criteria (Al)-(A6), but with fynext —>■ to- 
However, this implicitly assumes that fydec < tg < fj+i,dec (essentially criterion (Al)). It 
will in general be necessary to specify what should be done if the event does not ultimately 
satisfy that criterion. For example, it could be stated that in this case, the observations 
continue at the same cadence and the decision is made during the next week. 

8. Objective Allocation of Remaining Observing Time 
8.1. General Considerations 


An important aim of the Fpzteer observations, as outlined in the proposal (Gould et al. 


2014) is to detect and characterize planets from Spitzer observations themselves. Because 


planet sensitivity scales with magnification, this can best be done by monitoring higher- 
magnification events more intensively from Spitzer, in particular events that are at higher 
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magnification as seen by Spitzer. Hence, after the fundamental goal is met by allocating 
enough observation time to reliably measure parallaxes (as outlined in Section and and 
also to obtaining parallaxes of microlens binaries (which was also part of the proposal but is 
not the subject of the present study), the remaining time should be allocated to this purpose. 

Because these additional planet-hnding observations involve allocation of additional time 
to the same set of events that are the object of microlens parallax measurements, it is im¬ 
portant to isolate the decisions about this allocation from the prospect of improving parallax 
measurements. Otherwise, events with known planets could receive additional measurements 
aimed at measuring parallax, making them more likely to have good parallax measurements 
relative to those that failed to show any planets. And since only events with good parallax 
measurements can be included in the sample to measure the Galactic distribution of planets, 
we must eliminate the potential for bias. 

At the same time, it is difficult to develop completely objective criteria for these alloca¬ 
tions because of the wide range of the possible quantities of available time and wide range 
of the possible quality of events to which they might be applied. In addition, the amount of 
additional time will depend on the precise number of events being monitored. Finally, the 
targets for a given week will have a wide range of magnihcations, whose distribution cannot 
be predicted in advance. 


8.2. Practical Execution 


We propose the following algorithm to effectively separate these decisions while leaving 
adequate freedom to respond to potential planet sensitivity by obtaining additional Spitzer 
observations. 


Recall from Section 5.3 that the most efficient way to observe these events is to cycle 
through the Bulge West-to-East and East-to-West. The priority of an event sets how often 
an event will be observed. Specihcally, priority n = 1 is observed every cycle whereas priority 
n = 8 is observed every eighth cycle. The exact number of cycles depends on the the number 
of events, their priorities, and the amount of time available. 


All events that have been chosen for Spitzer observations, except newly selected subjec¬ 
tive events are rank ordered according to the 2 a lower limit of their highest magnihcation 
(Aj^max) in the observing window [tynext, tj+i.next]- These are then assigned priorities that 
map monotonically to this ranking. The break points in this mapping are decided manually. 
For example, Aymax > 20 ^ n = 1, 10 < Aymax < 20 ^ n = 2, 5 < Aymax < 10 u = 3, 
3 < Aymax < 5 ^ u = 4. The only other rule is that the priorities and the number of events 
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in each category result in a total number of observations that equals the number of observa¬ 
tions available. Hence, there are likely many choices of break points and priorities that fulhll 
these two criteria (monotonic mapping and total observations) for any given rank-ordered 
set of events and amount of observing time. The final choice, which will set the number of 
cycles and their duration, is at the discretion of the Spitzer team. 

While the manual decision on break points might seem to allow skewing (conscious 
or unconscious) of the strength of different parallax measurements, in fact this is virtually 
impossible. The events whose parallax is poorly measured with existing data, and therefore 
might require “saving” with additional observations, will be at low magnification and hence 
cannot be helped by any manual decision that is constrained by the monotonic mapping 
described above. 

Here, the default meaning of “highest magnification” is highest as predicted for obser¬ 
vations from the ground. However, it may be possible to download and process Spitzer data 
sufficiently quickly to make predictions about the course of the event as seen from Spitzer. 
In this case, the Spitzer magnifications should take precedence. 

Why do we exclude the subjectively chosen events whose observations are just start¬ 
ing? There are two reasons. First, the future course of these events is often very poorly 
understood, so the 2 a lower limit on the magnification is likely to be very low and hence 
unlikely to trigger the additional observations being considered here. More fundamentally, 
the cadence of these first-week observations is subjectively decided, so no objective procedure 
is required to allocate additional observations to planet hunting in these events if the team 
decides that is necessary. 

The cadence (or algorithm for determining the cadence) of Spitzer observations must be 
specified at the time of the commitment to observe a subjectively selected event. However, 
in practice these cadences will almost always be set at tsei, since even for “subjective, secret” 
events tcom will generally happen before the next decision point when the cadence could be 
changed (t^+i^dec)- In subsequent weeks, any change in the cadence of these events must be 
through the process described above. 


9. Conclusions 

We have outlined an approach for maximizing the planet sensitivity of space-based 
microlens parallax surveys, for the particular case that the satellite targets are chosen based 
on ground-based identification of events. This applies to current observations by Spitzer and 
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any narrow field-of-view, targeted observations from spac^ 

The basic principles are: 

First, objective criteria are qnite easy to establish for events that have already peaked 
becanse their hts are well constrained. Becanse the criteria are objective, the entire time 
span of those events, with respect to both planets and planet sensitivity, can be inclnded in 
the analysis. This inclndes, in particnlar, all of the time before peak, which lies before the 
onset of Spitzer observations and also before the event was even recognized as microlensing. 

Second, it is also possible to establish objective criteria for a snbset of pre-peak events. 
However, becanse these events are pre-peak, their model hts may not be reliable, so it is 
necessary to dehne these criteria in terms of observables. 

Third, for objectively chosen events, cadences must also be determined by objective 
criteria. 

Fourth, for those events in low-cadence survey helds, it is less important to dehne 
objective criteria, because the events have low sensitivity to planets unless additional follow¬ 
up observations are obtained. 

Fifth, the remaining events, including those in low-cadence helds, can be chosen subjec¬ 
tively, but the full cadence (or prescription for determining the cadence) must be specihed 
at the time that they are selected. Subjective selection can take several forms, but the most 
important aspect is when a commitment is made (and announced) to observe the event. It 
is this date that determines what planet sensitivity and planet detections are included in the 
analysis. 

Sixth, in the case of events that were previously chosen on subjective grounds and 
that subsequently meet the objective criteria, their objective status must take precedence 
in evaluating the event as part of the sample. This assumes that a parallax measurement 
proved possible based only on the (more restricted subset of) objectively required Spitzer 
observations. If not, and if the full set of Spitzer observations yields a parallax measurement, 
then they revert to subjective status. 

This paper constitutes a public announcement of our objective criteria and procedures. 
If there are any updates to these, they will be posted on arXiv as a revision to or update of 
this paper. 


® It does not apply to a microlens survey by Kepler (K2) because the field will be a large, pre-selected 
region rather than having the observations targeted at specific, known microlensing events. 
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